On not recognizing disfluencies in dialogue

نویسندگان

  • Robin J. Lickley
  • Ellen Gurman Bard
چکیده

This paper tests the hypothesis that listeners miss dis uencies or fail to transcribe them accurately because dis uencies interfere with the normal relationship between speech sound and linguistic context in human spoken word recognition. In a word-level gating experiment 16 listeners heard a total of 56 dis uent utterances selected from a corpus of spontaneous speech, 56 length-matched uent controls, and 56 uent foils. The proportion of words never recognized was greater in dis uent utterances than in controls. The failures clustered around the point where the dis uency interrupted the utterance, ocurring particularly within the reparanda, but were not found at corresponding locations in uninterrupted controls. Repetition dis uencies, where preand post-interruption portions might easily be construed together, allowed more successful word recognitions than recast dis uencies, where reconstruction of a single intended utterance would be di cult, if not impossible. The results have implications both for understanding human speech recognition and for improving the robustness of ASR systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Recognizing emotions in dialogue with disfluences and non- verbal vocalisations

We investigate the usefulness of DISfluencies and Non-verbal Vocalisations (DIS-NV) for recognizing human emotions in dialogues. The proposed features measure filled pauses, fillers, stutters, laughter, and breath in utterances. The predictiveness of DISNV features is compared with lexical features and state-of-the-art low-level acoustic features. Our experimental results show that using DIS-NV...

متن کامل

Phone Elasticity in Disfluent Contexts

Disfluencies in speech are instances of hesitation or correction that affect both the speaker and the listener. Typical surface forms of disfluencies are filled pauses (such as uh, uhm), silences at places in the utterance where syntax would not predict them, or repetitions of parts of the utterance (like (I mean + I mean) we should go). Disfluencies carry a folk notion of erroneousness or badn...

متن کامل

Micro-structure of disfluencies: basics for conversational speech synthesis

Incremental dialogue systems can produce fast responses and can interact in a human-like fashion. However, these systems occasionally produce erroneous material or run out of things to say. Humans in such situations use disfluencies to remedy their ongoing production and signal this to the listener. We devised a new model for inserting disfluencies into synthesis and evaluated this approach in ...

متن کامل

Disfluent but effective? A quantitative study of disfluencies and conversational moves in team discourse

Situated dialogue systems that interact with humans as part of a team (e.g., robot teammates) need to be able to use information from communication channels to gauge the coordination level and effectiveness of the team. Currently, the feasibility of this end goal is limited by several gaps in both the empirical and computational literature. The purpose of this paper is to address those gaps in ...

متن کامل

Recognizing Emotions in Dialogues with Disfluencies and Non-verbal Vocalisations

We investigate the usefulness of DISfluencies and Non-verbal Vocalisations (DIS-NV) for recognizing human emotions in dialogues. The proposed features measure filled pauses, fillers, stutters, laughter, and breath in utterances. The predictiveness of DISNV features is compared with lexical features and state-of-the-art low-level acoustic features. Our experimental results show that using DIS-NV...

متن کامل

Deriving a strategy for synthesizing lengthening disfluencies based on spontaneous conversational speech data

Our overarching research project explores the usability of disfluencies in incremental spoken dialogue systems. This endeavor requires basic phonetic research on disfluencies in spontaneous speech corpora as to define strategies for synthesizing disfluencies in a meaningful way. In this paper, our current research focus lies in an investigation of disfluency-related lengthening as a promising t...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1996